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ABSTRACT 

The assumption that the Universe, on sufficiently large scales, is homogeneous and 
isotropic is crucial to our current understanding of cosmology. In this paper we test 
if the observed galaxy distribution is actually homogeneous on large scales. We have 
carried out a multifractal analysis of the galaxy distribution in a volume limited sub- 
sample from the SDSS DR6. This considers the scaling properties of different moments 
of galaxy number counts in spheres of varying radius r centered on galaxies. This anal- 
ysis gives the spectrum of generalized dimension D q (r), where q > quantifies the 
scaling properties in overdense regions and q < in underdense regions. We expect 
D q (r) = 3 for a homogeneous, random point distribution. 

In our analysis we have determined D q (r) in the range —4 < q < 4 and 7 < r < 
98/i _1 Mpc. In addition to the SDSS data we have analysed several random samples 
which are homogeneous by construction. Simulated galaxy samples generated from 
dark matter N-body simulations and the Millennium Run were also analysed. The 
SDSS data is considered to be homogeneous if the measured D q is consistent with that 
of the random samples. We find that the galaxy distribution becomes homogeneous 
at a length-scale between 60 and 70/i _1 Mpc. The galaxy distribution, we find, is 
homogeneous at length-scales greater than 70/i~ 1 Mpc. This is consistent with earlier 
works which find the transition to homogeneity at around 70/i _1 Mpc. 

Key words: methods: numerical - galaxies: statistics - cosmology: theory - large 
scale structure of the Universe. 



1 INTRODUCTION 



The assumption that the universe is homogeneous and isotropic, known as the Cosmol ogical Principle, is perhaps the most 
fundamental postulate of the currently well accepted standard model of cosmology (e.g. Weinberg 200S). The Sloan Digital 
Sky Survey (SDSS), the largest galaxy survey till date, has mapped out the galaxy distribution in a large volume of space. This 
provides a unique opportunity to observationally test the Cosmological Principle. The two-poin t correlation function and the 
power spectrum, which are the standard techniques to quantify galaxy clustering l|Peebleslll98"ol ). rely on the assumption that 
the Cosmological Principle i s vali d on a sufficiently large scale, and hence cannot be used to test the Cosmological Principle. 



Coleman fc Pietronerd ([1992) have proposed that the universe has a fractal structure. If true, this violates the Cosmologi- 
cal Principle. While some of the subsequent analysis of galaxy surveys show evidence for homogeneity in the galaxy distributio n 



at sufficiently large scales ( Guzzolll997l : lBharadwai et al.fl999l : IPan fc Colesll2000l : lTikhonov et al.ll2000l : lKurokawa et al.ll200ll ). 
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there are others who either fail to find a transition to homogeneity or find definite evidence for a fractal str ucture out to the 



largest scale probed (Hatton 



In a recent study 



1999 



Yadav et al 



Amendola fc Palladinolll999l ; [rBeslboOOl : IfBarvshev fc Bukhmastovall2004h . 
( 20051 )" have analysed nearly two-dimensional (2D) str ips from the SPSS Data Release One 
to find a definite transition to homogeneity occurring at the length-scale 60 — 70 /i~ 1 Mpc. iHogg et al.l (2005) have analysed the 
distribution of Luminous Red Galaxies (L RG) in the SPSS to find a transiti on to homogeneity at ~ 70 h~ 1 Mpc. The transition 
to homogeneity, however, is contested by Svlos Labini et al. 1 2007 . 2009 8.^ who fail to find a transition to homogeneity at 
the largest scales (~ 100 /i _1 Mpc) which they probe in the 2dF and SPSS galaxy surveys. 

In this paper we have analysed the SPSS Pata Release Six (PR6) to test if the galaxy distribution exhibits a transition to 
homogen eity at large scales, and i f so to determin e the scale of homogeneity. To achieve this we have carried out a multifractal 
analysis I Martinez fc Jonea 1990l : Borgani 1995 ) of the galaxy distribution in a volume limited subsample drawn from the 
SPSS PR6. The multifractal analysis quantifies the scaling properties of a number of different moments of the galaxy counts, 
allowing for the possi bility that thi s may differ depending on the density environment. The current work improves upon the 
earlier 2P analysis of lYadav et alj (120051 ). The large contiguous region covered by SPSS PR6 allows us to carry out a full 
three dimensional analysis covering scales upto 103 /i _1 Mpc in the volume limited sample. The transition to homogeneity has 
been identified by comparing the observations with a random point distribution having the same number and geometry as the 
actual galaxy distribution. The observations have also been compared with ACPM N-body simulations which are also used 
to determine the error-bars. 

A brief outline of the paper follows. We describe the data and the method of analysis in the section^] Section presents 
the results and conclusions of our study. 



2 DATA AND METHOD OF ANALYSIS 



2.1 SDSS DR6 Data 



Our present analysis is based on galaxy redshift data from the SDSS DR6 ( Adelman- McCarthy et al. 20081 ). The SDSS DR6 
includes 9583 square degrees of imaging and 7425 square degrees of spectroscopy with 790, 860 galaxy reds hifts. For the 



presen t work we have used the Main Galaxy Sample for which the target selection algorithm is detailed in I Strauss et al 
( 20021 ). The Main Galaxy Sample comprises of galaxies brighter than a limiting r band Petrosian magnitude 17.77. The data 



was downloaded from the Catalog Archive Server (CAS) of SDSS DR6 using a Structured Query Language (SQL) search. 
We have identified a contiguous region in t he Northern Gala c tic C ap which spans —50° < A < 30° and —6° < r\ < 35° where 
A and rj are survey co-ordinates defined in IStoughton et alj ((2002) . A volume limited galaxy subsample was constructed in 
this region by restricting the extinction corrected Petrosian r band apparent magnitude to the range 14.5 < m r < 17.77 
and restricting the absolute magnitude to the range —20 > M r > —21. This gives us 41, 234 galaxies in the redshift range 
0.04 < z < 0.11 which corresponds to comoving radial distances from 130 /i _1 Mpc to 335 /i -1 Mpc. 



2.2 N-body Data 



We have simulated the dark matter distribution at z = using a Particle- Mesh (PM) N-body code. This was used to simulate 
the observed galaxy distribution by identifying randomly chosen dark matter particles as galaxies. Here we have chosen 41, 234 
particles from a region which is exactly identical to the survey volume. 

The N-body simulations use 512 3 particles on a 512 3 mesh, with grid spacing 1 ft _1 Mpc. The si mulations use a ACDM 
power spectrum with cosmological parameters (fl m o, RiYo, h, n s , a&) = (0.274,0.726,0.705,0.96,0.812) 1 Komatsu et al. 20091 ) . 
The effect of peculiar velocities was incorporated using the plane parallel approximation. We have run five independent 
realizations of the N-body code, and the simulated galaxy distribution were analysed in exactly the same way as the actual 
data. 

The galaxies in our simulations are expected to exactly trace the dark matter distribution. Galaxy formation is a complex 
non-linear process, and it is quite possible that the galaxies are a biased tracer of the u nderlying dark matt er distribution. To 
account for this possibility, we have a lso considered a semi analytic galaxy catalogue ( Croton et al. 2006! ) from Millennium 
Run Simulation l|Springel et al.1 120051 ). Semi analytic models are simplified simulations of the formation and evolution of 
galaxies in a hierarchical clustering scenario incorporating all relevant physics of galax y formation processe s . The spectra and 
magnitude of the model galaxies were computed using population synthesis models of iBruzual fc Charloti (|2003l ) and we use 
the catalog where the galaxy magnitudes are available in the SDSS u, g, r, i, z filters. The catalog contains about 9 million 
galaxies in a (500 /i _1 Mpc) 3 box. Using the peculiar velocities we map the galaxies to redshift space and then identify a region 
having same geometry and choose the same number of galaxies within the specified magnitude range as the actual galaxy 
sample. 
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2.3 Method of analysis 

Scale invariance of galaxy clustering ( Jones et al. 20051 ). atleast at small scales, motivates us to use fractal analysis. There 
are various definitions of fractal dimension e.g. the Hausdorff Dimension, Capacity Dimension, Similarity Dimension, Box 
counting Dimension, etc. These definitions represent particular cases of the multifractal spectrum of generalised dimension. 
In this paper, we use the generalised Minkowski-Bouligand dimension to characterise the galaxy distribution. This contains 
complete information about various moments of the galaxy counts, and hence, is well suited for the multifractal analysis. 

Considering the i th galaxy as center, we determine rii(< r) the number of other galaxies within a sphere of comoving 
radius r. The generalised correlation integral is defined as 



M 
i=l 



(1) 



where M is the number of centers, N is the total number of galaxies and q — 1 refers to a particular moment of the galaxy 
counts. For a fixed r, we have used all the galaxies, barring those that lie within a distance r from the survey boundary, as 
centers. In our analysis N = 41234 and M = 27706 and 6108 for r = 20 and 70 h" 1 Mpc respectively. 

The generalised Minkowski-Bouligand dimension D q follows from the correlation integral, and is given by 

1 d logC q (r) 



D q (r) 



1 d log r 



(2) 



For q > 1, D q probes the scaling behaviour of galaxies in high density environments e.g. clusters and super-clusters and for 
q < 1, D q probes the same in under-dense environments like voids. We have varied q from q — —4 to q — +4 in steps of 1. In 
principle, we could have considered eve n higher and lower values of q but the finiteness of the data increases the scatter in the 
D q value as the value of | q | increases ( Baela et al. 20081 ). We have calculated D q (r) by numerically differentiating C q using 
the Ridders' method I Press et al.lll992f l considering C q (r) at three consecutive r values each separated by 5h~ Mpc. Using 
a smaller r interval (l/i _1 Mpc) , we find an increase in the fluctuations in D q {r) while using a larger interval (10/i _1 Mpc) 
gives nearly the same results as (5/i _1 Mpc). 

We have determined C q (r) in the range 1 < r < 103/i~ 1 Mpc and D q (r) in the range 7 < r < 98/i _1 Mpc for the data 
as well as the simulations. To assess the transition to homogeneity we have also generated "random samples" which contain 
randomly located points. The number of points and the volume coverage of these samples is exactly same as that of SDSS 
data. The point distribution in the random samples are homogeneous by construction, and we have generated 10 independent 
samples. The data, or the simulations, are considered to be homogeneous on length-scales where D q (r) is consistent with that 
of the random sample. 



3 RESULTS AND CONCLUSIONS 

Figure [1] shows the correlation integral C q (r) for q — —2 and q = +3. The behaviour is similar for other values of q. We find 
that C q (r) increases with r for positive values of q whereas it falls progressively for negative values of q. It is quite clear that at 
all length-scales the SDSS galaxies, the dark matter and the Millennium simulations all exhibit the same scaling behaviour. It 
is also quite clear that on small scales, less than approximately 30 and 40fe _1 Mpc for q = —2 and 3 respectively, this scaling 
behaviour is distinctly different from that of the random samples. The scaling behaviour of the actual data, simulations and 
the random sample all appear to match as r — > 100 h^ 1 Mpc, and the C q (r) curves all coincide if plotted on the same scale. 
The match between the data and the random samples indicates a transition to homogeneity at a length-scale smaller than 
100/i _1 Mpc, and the issue is to identify the length-scale where this transition occurs. 

As discussed earlier, we have numerically differentiated C q (r) to calculate the Minkowski-Bouligand dimension D q (r) at 
different value of r. The results are shown in Figure [2] for q = — 2 and +3. The behaviour is similar for other values of q. We 
expect the random samples to have D q = 3, the same as the dimension of the ambient space, irrespective of the value of r. 
We find that D q (r) has values close to 3, the deviations being less than 7%. At length-scales of r ~ 30/i _1 Mpc the data and 
the simulations have D q (r) values that are quite different from the random samples. We find D q ~ 3.2 and 2.6 for q — —2 
and +3 respectively. The difference from the random samples is larger than the 1 — a statistical fluctuations expected for 
the data, It is clear from this figure that the SDSS data is consistent with the random data at length-scales 70/i _1 Mpc and 
larger. Based on this we conclude that the transition to homogeneity occurs at a length-scale between 60 and 70/i _1 Mpc. We 
note that the dark matter N-body simulations and the Millennium simulation also exhibit a transition to homogeneity at a 
similar length-scale as the SDSS galaxies. 

We next consider the full spectrum of generalised dimension D q (r) with —4 < q < 4 at r = 60 and 70 /i _1 Mpc (Figure 
[3|. At 60 ft -1 Mpc the SDSS and random data are consistent for positive q, but they do not agree for q < where D q is larger 
for the SDSS data. This indicates that at 60 h' 1 Mpc the overdense regions are consistent with homogeneity whereas the 
underdense regions are not. At 70 h~ 1 Mpc we find that the SDSS and random data are consistent for the entire q range that 
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Figure 1. This shows C q (r) for q = —2 (left panel) and q = 3 (right). The different curves (top to bottom) show the Millennium Run 
(pink-dotted), dark matter N-body (blue-dashed), SDSS data (red-thick solid) and random data (green-thin solid). The two simulation 
curves have been scaled arbitrarily for convenience of plotting. The four curves shown here completely overlap at large r if plotted with 
the same scale. For the N-body and random data, the mean C q averaged over different independent realisations is shown. 




20 30 40 50 60 70 80 90 100 20 30 40 50 60 70 80 90 100 

r (h~ 1 Mpc) r (h" 1 Mpc) 

Figure 2. This shows D q (r) for q = —2 (left panel) and q = 3 (right). The mean and 1 — a error-bars for the random data and the 
N-body simulations were respectively determined using 10 and 5 independent realisations. We have used the N-body 1 — a error-bars to 
estimate the expected statistical fluctuation in the actual SDSS data, and show these on the SDSS results instead of the mean N-body 
curve. We consider the SDSS data to be consistent with the random data if the random data lies within the 1 — a error-bars of the SDSS. 



we have analysed. This is in keeping with our conclusion that the transition to homogeneity occurs at a length-scale between 
60 and 70/i~ 1 Mpc. 

Our results are in disagreement with similar analyses by Svlos Labini et al. 1 2007 . 2009al lbh who report long range cor- 
relations and persistent fluctuations in the large scale galaxy distribution, and fail to find a transition to homogeneity. The 
results of this paper are in good agreement w ith an earlier 2D analysis of the SDSS main galaxy sample (|Yadav et al.l 12005) 
and the SDSS LRG sample (|Hogg et alj|2005tl both of which find a transition to homogeneity at around 70 h 1 Mpc. Note that 
the LRG sample covers a much larger volume as compared to the main galaxy sample considered in this paper. A visual in- 
spection of the galaxy distribution reveals that the galaxies appear to be distributed in an interconnected network of filaments 
encircling voids. This n etwo rk, referred to as the "Cosmi c Web" , appears to fill the entire region covered by galaxy surveys. 



Bharadwai et al.l (|2004r ) and lPandev fc Bharadwail (|2005f ) have shown that the observed filaments are statistically significant 
only to length-scales of 70/i _1 Mpc and not beyond. Longer filaments, though seen, are produced by chance alignments and 
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Figure 3. This shows D q as a function of q for r = 60 (left panel) and 70/i -1 Mpc (right). We show the SDSS data (red points), random 
data (green solid line), N-body (blue dashed) and Millennium (pink dotted). Five independent N-body realisations were used to estimate 
the 1 — a error-bars shown on the SDSS data. 



are not statistically significant. The good agreement between all of these findings provide strong evidence that the galaxy 
distribution exhibits a transition to homogeneity at around 70/i _1 Mpc. 
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